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INTROOUCTION 


The CLASSY algorlthw attempts to decompose multivariate Landsat spectral data 
as a mixture of iiwltivariate normal distributions. It is hoped that this 
information can be used to increase the accuracy of the estimates of area pro- 
portion of certain crops of interest. In order for this to be true, the mix- 
ture components of the CLASSY decomposition should each represent a spectral 
signature overwhelmingly of the crop class of interest, This note will study 
the purity of CLASSY components, and propose a Bayesian method for using that 
information to improve maximum likelihood area estimates# The method is then 
tested on ten LACIE Transition Year segments with the classifier trained by 
Analyst Interpreter labels. 
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2 . PURITY OF CUSSY COMPONENTS 


The CLASSY algorithm (Lennington and Rassbach [1979]) estimates a mixture 
decomposition of continuous multivariate data Into multivariate normal 
components. When the data consists of spectral values from LACIE segments, 
possibly from several different acquisitions at widely spaced times, It Is 
hoped that CLASSf decomposition may aid In the estimation of proportions of 
various crops present. If p^^’twre elements characteristic of a particular 
component represent only a random assortment of crop types, clearly nothing 
has been gained. The hope would be that In many cases a CLASSY component 
would represent the signature of a particular crop, or a special case of such 
a signaturei therefore, picture elements characteristic of that component 
would overwhelmingly belong to the crop of Interest. 

This latter case Is that of high component purity. We will measure purity by 
an Index 3,3*0 means that none of the component Is planted In the crop of 
Interest, 3 * .4 means that 40X of the component Is In the crop, and 3 ■ l.O 
means that all of the component corresponds to the crop. CLASSY Is most useful 
to us when the 3's tend to be near zero and one. 

The 3*s may be estimated by the method of maximum likelihood (Lennington and 
Terrell [1980]). The purity for small grains of 113 CLASSY components of 
eleven Year Three LACIE segments were estimated using the MAXLABEL program; the 
estimator was trained using ground truth crop type for approximately 
100 pixels per segment. Figure 1 shows a histogram of the S's obtained. It 
Is encouraging that a substantial majority are near zero or one. 

In the sequel It will be useful to have a mathematical description of the 
statistical distribution of the component purities. In the case of two crop 
types, a Beta distribution with density 



on (0,1) 



and zero elsewhere would be convenient* Since the mean 5 ■ y|-g and 

2 * 

variance 0, ^ ■ ■' - m - — . we may estimate a and b from the sample 

(a + b)2(a 4. b + 1) 

mean and standard deviation. In our data > «?S8, Sg ■ .389, Now 

, . b . liLjjsl . 1 

SO that a ■ (a '«■ b)ii b ■ (a b)(l - u). Substituting population values we 
got 

a ■ .187 b ■ .335 

The continuous curve In figure I Is sufficiently close to the observed 
distribution to reassure us of the plausibility of the assumption of a Beta 
distribution (It Is simply the density B.i87,,335 f*escaled for the histogram). 

» A 

The fact that a Is smaller than b Is related both to the greater purity of 
non-small -grains components and the fact that small grains elements were In 
the minority In our sample segments. When a and b were estimated for another 
population of Transition Year segments, but the estimators were trained with 
Analyst Interpreter dots, a ■ .155, b ■ .441 were obtained. These are quite 
close in practice to the previous results; they seem to reflect In addition 
the tendency of Analyst Interpreters to make more errors of omission than of 
coimlssion. 
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3, ESTIMATION OF POSTERIOR PURITY 


Uc now proceed to utilize prior Information to estimate small grains propor- 
tion. Two approaches to area estimation have been proposed: If Is the 
prior probability of component then a proportion estimate Is where 
jeP If and only If Sj > 1/2. Alternatively, a proportion estimate Is Sgjcij 
over all components (Lennington and Terrell (1980)). Notice that the two 
methods are equivalent If all consonants are pure. 


The generalization of the Beta distribution to a number of simultaneously 
Independently distributed purities Is the Oirichlet distribution. In other 
words, the posterior density of the By 1 ■ I, •••, c, j ■ I, •••, d where c 
it the number of crops and d Is the number of components Is 


that 



with the constraint 


By - 1, 0 < By 


< 1 and e^ 


> -1 


is a measure of typical prior purity of the 1^^ crop, and K Is a constant such 
that the distribution Integrates to one. 


Let , where K| » I, •••, be the spectral vectors of the set of training 

pixels labeled as crop 1. Let fj be the multinormal density of the 
component estimated by CLASSY. We will assume that the probability that a 
pixel belongs to a particular crop depends only on Its component metnbershlp, 
and not on Its spectral location within a component distribution. This simply 
means that each component Is homogeneous; If this were not the case, it would cast 
doubt on the assumption of multi normality. The likelihood of the observations 


^1K 


1 


Is then 


c 1 d 

n n z Prob {Ll. , 1, j} 

i-l j-1 ”^1 
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c d 

■ n HE Prob (ti,/ , f|j) Prob (j) 

1*1 K|-l j-l 

c d 

- n HE Prob {tu Prob {i|J) Prob 

1*1 K^-1 j*l ^'^1 

g d 

- n HE Prob it^u |j) Prob (1|j) Prob (j) 
i-1 K^-1 j-l 

c d 

■ n n E ) ® 4<*4 

1-1 K^-l j-l ^ ^*^1 ’J J 

Given the prior Information represented by the ejj’s, we get a posterior 
likelihood 


1 - K n ]( 

1-1 


d e, ^1 d 


n $.!) ( n E ) 8.. « ) 

j-l ’J K^-1 j-l J ’’^1 J 


and taking logarithms 


L - log 1 ■ log K e e< Z log 

1-1 ^ j-l 


c ^ 


E E log C z ®<04 ■I^ 4 (^ 4 |/ )3 
1-1 K^-l j-l J ^ 


j* 

We will estimate cluster purities by taking the maximum posterior likelihood 

c 

under the constraints E g.. - 1, j - 1, •••, d, using the method of Lagrange 

1-1 ’J 

multipliers. The recursive fixed-point solution was given by 


^1j -T 




E S. 
m-1 


mj 


6 


wHtrt 


* *1 * J , “3 I 

Notice the parallel betvMten this method and the classical maximum likelihood 
solution In Lennington and Terrell (1980). This solution becomes the 
classical solution by setting e^ « 0 for each 1. 

The ease with which the Dirichlet prior enters Into this solution may be 
attributed to the fact that It Is conjugate to the multinomial distribution, 
and the S's have some of the character of multinomial probabilities. 
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TABLE 1.- ESTIMATES OF SPRING SMALL GRAINS PROPORTION 


(1) 

Stgment 

(2) 

Ground 

Truth 

% 

(3) 

Maxlabel 

Stratified 

i 

(4) 

Maxlabel 

Cluster 

% 

(5) 

Empirical 

Prior 

Stratified 

i 

(6) 

Empirical 

Prior 

Cluster 

% 

1394 

35.45 

35.64 

27 M 

34.80 

27.66 

1457 

47.72 

31.20 

zs,n 

30.58 

25.71 

1518 

34.16 

26.81 

20.67 

26.00 

20.67 

1602 

30.42 

24.41 

21.79 

24.18 

21.79 

1619 

47.91 

38.32 

39.19 

38.48 

39.19 

1668 

9.49 

7.49 

6.34 

6.34 

6.34 

1825 

26.69 

22.75 

19.33 

22.63 

19.33 

1909 

22.35 

10.18 

9.34 

9.78 

9.34 

1918 

15.02 

18.54 

18.16 

18.29 

18.16 

Bias % 


-6.01 

-9.57 

-6.60 

-9.57 

Mean 

squared 

error 


.00674 

.0134 

.00746 

.0134 





4. APPLICATION TO SMALL GRAINS ESTIMATION 


The procedure of the last section was Incorporated into a FORTRAN program to 
run on the LARS system at Purdue University by modifying the MAXUbEL program 
(Horton and Lennington (1980)). Since the e^'s may be negative, It was neces- 
sary to Introduce the additional constraint that 0 < < 1. This was accom- 

plished by setting an S^j equal to zero whenever It becomes negative during 
the process of Iteration. No effect on the average rate of convergence was 
noted as a result of this change. 

Ten Transition Year LACIE segments In which spring small grains were the major 
crop of Interest were chosen because they had four good quality acquisitions 
spaced over the growing season, they had been handled by the PI procedure so 
that approximately two hundred analyst-labeled pixels were available, and 
estimates of the true proportion of small grains were available from ground 
truth surveys. The four spectral values at each acquisition were projected 
onto the Kauth-Thomas (1976) greenness-brightness plane; thus, there were 
eight components of the spectral vector for each pixel. 

The CLASSY program was run for each of the ten segments listed In column one 
giving a decomposition into mixtures of eight dimensional multi normal compo- 
nents for each segment. The program described above (called PRELABEL) was 
then run for the segments with several sets of values of the prior purities. 
The results are summarized in Table 1. For the third and fourth columns, all 
e<'s were set to zero, giving the equivalent of a maximum likelihood solution. 

* A A 

For columns five and six, e^ * b - 1 * -.665 and e£ * a - 1 ■ -.813 were used 

as these are the empirical values found in section 11. Column two is the 

ground truth proportion of spring small grains for each segment. At the foot 

of each column are the mean error in percent (called bias) and the mean 

squared error of each estimate. Columns two and four use the estimator 
d 

E and columns three and five use E o. where jePi, if and only if 

j-1 J jcp J ^ 

> 1/2. It is clear that the introduction of prior purities made no 

difference to the second estimates and very little difference to the first. A 
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further experiment In whieh the *il1ffute‘^ prior ij * -I for all 1 was used 
made even less difference. The results still reflect the analyst tendency to 
prefer errors of omission to errors of commissioni 
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3 . COWaUSIONS 


A notable result of this study was the success In fitting a Beta distribution 
to maximum likelihood estimates of component purity. The fact that 3 and 6 
are substantially less than one Indicates that CLASSY components show more 
than chance tendency to achieve high degrees of purity In crops of Interest. 
This Is the bulk of the evidence In existence that CLASSY actually extracts 
features of importance from Landsat data. 

On the other hand, the methods studied for Introducing the prior Information 
Into the process of estimation make little difference to the results when used 
with AI labeled samples, and are not recommended for Incorporation Into 
practical estimation procedures. 
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